Image decomposition with anisotropic diffusion applied to leaf-texture analysis 



Bruno Brandoli Machado, Wesley Nunes Goncalves, Odemir Martinez Bruno 
Physics Institute of Sao Carlos (IFSC) 
University of Sao Paulo (USP) 
Av. Trabalhador Sao-carlense, 400 
Cx. Postal 369 - Sao Carlos - SP - Brasil 
wnunes@ursa.ifsc.usp.br, brandoli@icmc.usp.br, bruno@ifsc.usp.br 



Abstract 

Texture analysis is an important field of investigation 
that has received a great deal of interest from computer vi- 
sion community. In this paper, we propose a novel approach 
for texture modeling based on partial differential equation 
(PDE). Each image f is decomposed into a family of de- 
rived sub-images, f is split into the u component, obtained 
with anisotropic diffusion, and the v component which is 
calculated by the difference between the original image and 
the u component. After enhancing the texture attribute v 
of the image, Gabor features are computed as descriptors. 
We validate the proposed approach on two texture datasets 
with high variability. We also evaluate our approach on an 
important real-world application: leaf-texture analysis. Ex- 
perimental results indicate that our approach can be used 
to produce higher classification rates and can be success- 
fully employed for different texture applications. 



1. Introduction 

Texture plays an important role in pattern recognition 
and computer vision. Applications with textures are found 
in several areas, including remote sensing [7] and plant leaf 
identification [3 1. Though texture is easily perceived by hu- 
mans, it has no precise definition due to its spatial distri- 
bution. In addition, physical surface properties produce dis- 
tinct texture patterns. Thus the lack of a formal definition of 
texture is reflected into different methods for texture analy- 
sis. 

Many methods for texture description have been pro- 
posed in the literature [24|. They are based on statisti- 
cal analysis of the spatial distribution (e.g., co-occurrence 
matrices lfT4l [131 and local binary pattern ||T5ll ), stochas- 
tic models (e.g., Markov random fields Q), spectral anal- 
ysis (e.g., Fourier descriptors [fl~|, Gabor filters [12| and 



wavelets transform [10]), structural models (e.g., math- 
ematical morphology ll22l and geometrical analysis ||8ll), 
complexity analysis (e.g., fractal dimension l20l [6] |2)), 
agent-based model (e.g., deterministic tourist walk |4|). De- 
spite there are effective texture methods, few papers are 
concerned in enhancing the richness of the texture attribute 
before computing features. 

Inspired by biological vision studies, the community of 
computer vision has also shown a great deal in represent- 
ing images using multiple scales. The basic idea is to de- 
compose the original image into a family of derived im- 
ages |Tl8l l23l . The decomposition is obtained by convolv- 
ing the original image with an image operator, for example, 
a simple way is to employ Gaussian kernels. Although the 
Gaussian filtering satisfies the heat equation, its derivatives 
cause spatial distortion in region boundaries. It implies that 
the diffusion process is equally in all directions, that is, the 
diffusion is linear or isotropic. On the other hand, Perona 
and Malik formulate a new concept that modified the lin- 
ear scale-space paradigm to smooth within a region while 
preserving edges. 

Due to the increasing interest in image analysis, we pro- 
pose a novel framework to model textures. In the proposed 
approach, image decomposition using anisotropic diffusion 
of Perona and Malik is performed before feature extraction. 
The anisotropic diffusion process is mathematically mod- 
eled by partial differential equations (PDEs). The decompo- 
sition is applied to extract the texture component, obtained 
by the difference between the original image and cartoon 
approximations. Then, Gabor filters are used to extract fea- 
tures from the texture component, which presents more en- 
hanced structures. 

The remaining of this paper is organized as follows. Sec- 
tion [2] presents background information on nonlinear dif- 
fusion and Gabor filters. Section [3] details our approach in 
texture analysis. Section [4] presents the results of the ex- 
periments performed on two benchmark texture datasets. 
Finally, conclusions and directions for future research are 



given in Section|6] 

2. Background 

In general, texture analysis is studied into five groups: 
(1) synthesis, (2) segmentation, (3) shape from texture, (4) 
compression and (5) classification. All groups have been in- 
fluenced by the use of decomposition and filter banks. Next 
we describe image decomposition using anisotropic diffu- 
sion and Gabor filters. 

2.1. Anisotropic Diffusion 

Scale-space theory has been investigated for represent- 
ing image structures at multiple scales. The idea is to de- 
compose the initial image into a family of derived images. 
According to ||23l and lfl6l , a family of derived images may 
be viewed as the solution of the heat equation and described 
using partial differential equations (PDEs). The successful 
use of PDEs in image analysis is assigned to the power to 
model many dynamic phenomenon, including diffusion. A 
new paradigm of nonlinear PDEs for image enhancement 
was introduced by Perona and Malik |2~T1 . Their formula- 
tion, called anisotropic diffusion, uses a nonlinear scheme 
that smoothes images by creating cartoon approximations, 
while the region boundaries remain sharp. Formally, the dis- 
crete formulation of Perona-Malik is defined as: 

lif = ll 3 + [c N .V N I + c s .V s I + c E .V E I + c w .V w I]\ 

(1)' 

where < A < 1/4 is a scalar that controls the numeri- 
cal stability, VI is the gradient magnitude, c is a constant 
value for the conduction coefficient, N, S, E and W are the 
mnemonic subscripts for North, South, East and West. The 
PDE equation above can be write as follows ((i, j) = s): 

= It + T E 9(VIs, P )VI s , p (2) 

where /* is the cartoon approximation image, t denotes 
the number of iterations, s denotes the pixel position, £ s 
represents the number of neighbors of pixel s (usually 4- 
connectivity), and g(VI) is the conduction function. The 
value of the gradient is computed by linearly approximat- 
ing its norm in a specific direction as: 

Vl s , p = l p -lt,peZ s (3) 

Perona and Malik proposed two functions of diffusion: 

fl (||W||)= e -(l™^ 2 (4) 



and 

The parameter K controls the conduction. The first equa- 
tion favours high contrast edges over low contrast ones, 
while the latter favours wide regions over smaller ones. Al- 
though Perona and Malik proposed two different functions, 
the smoothed images are quite similar. 

A texture decomposition with anisotropic diffusion is 
shown in Figure [T] The first row shows the family of car- 
toon approximations from the original image Iq- We can ob- 
serve that the information is gradually smoothed, while tex- 
tures, in third row, are enhanced by the difference between 
the original image and cartoon approximations. The solu- 
tion of the heat diffusion is depicted in rows 2 and 4. Note 
that the distribution of heat correponds to gray values in the 
image and the diffusion time is represented by the number 
of iterations t. For different scales t we obtain different lev- 
els of smoothing, as shown from t\ (Figure[TJb)) to £5 (Fig- 
ure[Tjf)). 

2.2. Gabor Filters 

A Gabor filter is a signal sinusoidal plane wave modu- 
lated by a Gaussian 1121 . The filters used in image decom- 
position are created from a "mother" Gabor function of two 
dimensions, for a given space g(x, y) and frequency G(x, y) 
domains. Given the "mother" function, a bank of Gabor fil- 
ters can be obtained in the g(x, y) space domain from oper- 
ations of dilatations and rotations. 

Initially, the Gabor technique generates a filter bank 
gmn{x, y) for different scales m = 1, . . . , K and orienta- 
tions n = 1, . . . , S parameters. Texture features are com- 
puted by convolving the original image / with the Gabor 
filter bank, as depicted in Equation ([6j. By tunning the val- 
ues of m and n, some aspects of the image's underlying tex- 
ture structure can be captured. In this work, a number of 40 
Gabor features have been computed (8 orientations and 5 
scales). 

(x,y) = I(x,y) * ffrnn (x,y) (6) 

The feature vector i\) = [En, E\i, . . . , Eks] is finally 
obtained by computing the energy of the filtered images ac- 
cording to the Equation 

E mn = y^[c mn (x, y)} 2 (7) 
x,y 

3. An Approach to Texture Analysis 

A widely strategy used to compute texture features with 
Gabor is to construct a bank of filters with different scales 
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Figure 1. The essential idea with a scale-space representation of a image is to create a family of car- 
toon approximations. This figure shows an initial image A, (a) that has been successively smoothed 
with anisotropic diffusion [(b)-(f)]. The family of derived images may be viewed as the solution of 
the heat conduction, depicted in rows 2 and 4. The third row corresponds to the texture compo- 
nent. 



and orientations parameters. For each Gabor space is ex- 
tracted statistical measures, such as energy and entropy. In- 
stead of obtaining right the Gabor space, an original im- 
age (/) is decomposed in a set of derived images with 
anisotropic diffusion of Perona and Malik, described in Sec- 
This procedure is executed with several levels of 



2.1 



tion 

decomposition (t) in order to evidence high frequencies 
(v), while it preserves important structures such as edges. 
At each level (t), we obtain two components: cartoon ap- 
proximation (u) and texture (v). The texture component is 
achieved by subtracting the original image and the cartoon 
approximation. An example of image decomposition using 
anisotropic diffusion is shonw in Figure [2] 



The filtering process aim at evidencing high frequencies 
in the image in order to produce richer representations. Per- 
ona and Malik filtering overcomes the main restriction im- 
posed by linear approaches, i.e., blur in region boundaries 
does not occur. The set of texture images v is then used 
to extract Gabor features and useful for a variety of tasks, 
for example, texture classification. The diagram of Figure [3] 
summarizes the approach proposed here. 



4. Experimental Evaluation 

In order to evaluate our approach, experiments are per- 
formed on two image datasets. First, the datasets used for 
evaluation are described. Then, implementation details of 
the descriptors and classifiers are discussed. Finally, the re- 
sults are shown. 

4.1. Datasets 

The Brodatz album [5| is the most known benchmark 
for evaluating texture methods. Each class is composed by 
one image divided into nine new samples non-overlapped. 
These images have 200 x 200 pixels with 256 gray levels. 
A total of 100 texture classes with 10 images per class was 
used. Recently, the Brodatz dataset has been criticized for 
certain weaknesses, including lack of viewpoint and scale 
variation, and illumination changes. Thus, we also use the 
Vistex dataset. 

The Vision Texture dataset [17| (or Vistex) contains 
a large set of natural colorful textures taken under several 
scale and illumination conditions. In addition, images are 
acquired with different cameras. For this dataset we use a 
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Figure 2. An example of image decomposition for the Barbara image (a). At each level of decomposi- 
tion, it is generated a cartoon approximation u and a texture component v. v is obtained by subtract- 
ing the original image and the cartoon approximation. 
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Figure 3. Our approach for texture analysis. 



total of 50 texture classes in gray scale. The size of the orig- 
inal images was 512 x 512, but we use the same number 
of samples as fl9l . Each texture were split into 128 x 128 
pixel images, with 16 sub-samples per class, totalizing 800 
images. 



4.2. Performance Evaluation 

In the experiments, we compute the energy of Gabor fil- 
ters with 8 orientations and 5 scales, resulting a feature vec- 
tor with 40 dimensions. We adopt the K nearest-neighbor 
(K-NN) classifier, since it is a good reference classification 
method in the texture recognition. A initial value of K = 5 
is used, with 10-fold cross validation and Euclidean similar- 
ity measure. Here, we change the levels of decomposition t 
of the anisotropic diffusion process (scales). The decompo- 
sition ranges from 10 to 200. The approach is evaluated us- 
ing two texture datasets. 

Experiment 1: First, we evaluate our approach on the Bro- 
datz dataset and compare it to the original Gabor features. 
Features are computed with different levels of decomposi- 
tion t. Figure |4] shows the classification rates in the y axis, 
while the levels of decomposition are indicated in the x axis. 
It can be observed that enhanced texture component (v), ex- 
tracted using our approach, performs better than the origi- 
nal Gabor method. The highest classification rate (t = 40) 
is 94.29% for texture (v) and 91% for the original Gabor, 
respectively. Note that the performance of the cartoon ap- 
proximations (u) get worst at each level of decomposition, 
which confirms our hypothesis that the component u can be 
discarded in order to improve the classification rate. 

Experiment 2: In this experiment we evaluate our approach 
on the Vistex dataset. The setting for this experiment is the 
same as the previous one. In Figure|5] the classification rates 
are presented in the y axis, while the decompositions are 
presented in the x axis. Our approach achieves the best per- 
formance with 88.96% (t = 140) against 83.66% for origi- 
nal Gabor. It is worth noting that the classification rates for 
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Table 1. Comparison of different values of 
nearest neighbors on both datasets. 



Figure 4. Comparison of different scales on 
the Brodatz dataset. 



the cartoon (u) component reduce at each iteration. This is 
associated to the gradual decomposition on the image. 
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Figure 5. Comparison of different scales on 
the Vistex dataset. 



refer to ifTTI for more details). For all operators, the same 
procedure of the proposed approach was performed. In this 
setting, our approach achieved the highest classification 
rates for all values of K on both datasets. For the Brodatz 
dataset, an improvement of 3.14% compared to the Gaus- 
sian operator was obtained using K = 5. On the Vistex 
dataset with K = 5, our approach achieved a classification 
rate of 88.96%, which is significantly better than the classi- 
fication rate of 83.63% achieved by the LoG operator. Ex- 
perimental results demonstrate that our approach is an ef- 
fective representation for texture modeling. 



Dataset 


Operator + Gabor 


%(3-NN) 
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Vistex 


Gaussian 


85.14 


82.75 


81.72 
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84.56 


82.71 


81.05 




LoG 


85.24 


83.63 


82.11 




Our approach 


89.21 


88.96 


86.65 



Table 2. Comparison of different image oper- 
ators on both datasets. 



Table [TJ presents the average and standard deviation in 
terms of classification rates. It also shows results for K = 
{3, 5, 7} on the original image (/), cartoon approximation 
(u) and texture (v). As we can see, our approach using the 
texture component (v) outperforms the others for all val- 
ues of K on both datasets. Interesting results came out from 
the cartoon approximation experiments, which is discarded 
in the proposed approach. A classification rate of 67.80% 
and 31.16% are obtained on the Brodatz dataset and Vis- 
tex dataset, respectively. It clearly shows the poor classifi- 
cation power using the cartoon approximation. 

To illustrate the potential of our approach, we compare it 
with three representative operators used for filtering edges: 
Gaussian, Laplacian and Laplacian of Gaussian (LoG) (we 



5. Leaf- Texture Enhancement: A Case Study 

Although there exist some tools interested in identifying 
plant species, amost none of them are concerned in enhanc- 
ing the texture attribute before computing features from im- 
ages. Here, we show a case study using a subset of five 
classes, with 10 images per class. One example of each class 
is shown in Figure [6] Again our approach achieved highest 
classification rates, according to Table [3] The results show 
that our approach is consistent, being a useful method to en- 
hance the texture attribute employed in real-world applica- 
tions. 




Figure 6. Leaf samples. 
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LoG 


74.20 
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86.00 


80.60 


76.20 



Table 3. Comparison of different values of 
nearest neighbors on the leaf dataset. 



6. Conclusions 

This paper proposed a new approach to enhance the rich- 
ness of the texture attribute by applying anisotropic diffu- 
sion as an early step in the texture image modeling. We have 
also demonstrated how the Gabor process can be improved 
by using our approach. Promising results have been ob- 
tained on two databases of high complexity. In the Brodatz 
dataset, experimental results indicate that the proposed ap- 
proach improves classification rate from 89.04% to 92.87% 
over the traditional approach. In addition, experimental re- 
sults on Vistex dataset demonstrated that the proposed ap- 
proach provides an improvement of 5.30% on classifica- 
tion rate. Our approach is able to successfully handle a wide 
range of texture methods, e.g. from Gabor filters to Markov 
random fields. In order to evaluate our approach, we per- 
formed it to enhance leaf-texture textures wide used in sys- 
tems of plant leaf identification. As part of the future work, 
we plan to focus on investigating new nonlinear PDEs and 
texture image methods. 
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